Restless Bandits, Linear Programming Relaxations, and a Primal-Dual Index Heuristic
نویسندگان
چکیده
We develop a mathematical programming approach for the classical PSPACE hard restless bandit problem in stochastic optimization. We introduce a hierarchy of n (where n is the number of bandits) increasingly stronger linear programming relaxations, the last of which is exact and corresponds to the (exponential size) formulation of the problem as a Markov decision chain, while the other relaxations provide bounds and are e ciently computed. We also propose a priority-index heuristic scheduling policy from the solution to the rst-order relaxation, where the indices are de ned in terms of optimal dual variables. In this way we propose a policy and a suboptimality guarantee. We report results of computational experiments that suggest that the proposed heuristic policy is nearly optimal. Moreover, the second-order relaxation is found to provide strong bounds on the optimal value.
منابع مشابه
Restless Bandits, Linear Programming Relaxations and a Primal-Dual Heuristic
We propose a mathematical programming approach for the classical PSPACE hard problem of n restless bandits in stochastic optimization. We introduce a series of n increasingly stronger linear programming relaxations, the last of which is exact and corresponds to the formulation of the problem as a Markov decision process that has exponential size, while other relaxations provide bounds and are e...
متن کاملSome Duality Results in Grey Linear Programming Problem
Different approaches are presented to address the uncertainty of data and appropriate description of uncertain parameters of linear programming models. One of them is to use the grey systems theory in modeling such problem. Especially, recently, grey linear programming has attracted many researchers. In this paper, a kind of linear programming with grey coefficients is discussed. Introducing th...
متن کاملABS Solution of equations of second kind and application to the primal-dual interior point method for linear programming
Abstract We consider an application of the ABS procedure to the linear systems arising from the primal-dual interior point methods where Newton method is used to compute path to the solution. When approaching the solution the linear system, which has the form of normal equations of the second kind, becomes more and more ill conditioned. We show how the use of the Huang algorithm in the ABS cl...
متن کاملA Linear Programming Relaxation and a Heuristic for the Restless Bandit Problem with General Switching Costs
We extend a relaxation technique due to Bertsimas and Niño-Mora for the restless bandit problem to the case where arbitrary costs penalize switching between the bandits. We also construct a one-step lookahead policy using the solution of the relaxation. Computational experiments and a bound for approximate dynamic programming provide some empirical support for the heuristic.
متن کاملOn energy-efficient TCP traffic over wireless cooperative relaying networks
Cooperative relay is a promising technology to provide better end-to-end performance for wireless communication through spatial diversity. Most of previous study on relay networks concentrates on lower layer performance. However, how to design the relay network for standard transport control protocol (TCP), which is very important for application layer service, is rarely investigated. We focus ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Operations Research
دوره 48 شماره
صفحات -
تاریخ انتشار 2000